Taxonomy vs Phylogeny 


Taxonomy was the traditional way that biologists recognised individual species 
(Genus species binomial) 


It was the tool that allowed identification of species in a broader hierarchy 
— Family, Order, Class, Phylum, Kingdom, etc. 


It was generally considered to reflect the phylogenetic history of species. 
— But it didn't really matter if it didn't … it was a tool to identify species. 


Phylogeny 


Phylogeny is the actual evolutionary relationship 
of species. 

A phylogeny has more information (detail) of 
relationships than taxonomy. 

A phylogeny was traditionally based an 
morphology, embryology, even physiology. 
Now advances in molecular biology (gene 
sequencing) have enormously contributed to 
“modern” phylogenies. 

We can NEVER know the REAL phylogeny — we 
can only infer ("educated guess”) it! 


COMPARING SPECIES: 


Phylogeny and Taxonomy 
provide a framework to compare species 


The goal is to understand the selective pressures 
that have led to animal traits. How do we know? 


Convergence (of dissimilar organisms to a common 


environment) 
Divergence (of similar organisms into new 
environments) 


Relatedness (all organisms are 
phylogenetically related) 


Comparative Animal Physiology 
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How do we compare species? 
It is easy to do a 2-species comparison — is it wrong? No! 


The standard approach to doing a two-species 
comparison is to do a 

test: 

In comparing two birds, a desert and a mesic 
species, the means of a trait for the two 
species can be tested to see if they are 
significantly different. 


two group comparisons. 
Here, there isa difference between the two 
species. 


But what was the question? 
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So what is the problem with 2-species comparisons? 


The problem is intent/statistical: We are not isolating the treatment (environmental 
water levels) we are interested in. Any differences we find could be attributed to any 
difference between the species or their environment. 


— Replication? Control? Randomisation? (how could these be used to improve the 
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Are you convinced this is a desert adaptation? 


Consider this example: Is cocoon formation related an adaptation to living in the desert? 
You decide to measure cocoon formation for 5 desert and 5 mesic frog species; You 
hypothesise cocooning (v) to be a desert adaptation and non-cocooning (x) to be a 
mesic adaptation 


Youfind: — Desert species Mesic species 
vv v vw x ox x x x 


CT caaea 


Simple statistics: What is the change that al 10 predictions come out as expected? > 
P=0.510=0.000977 (one-tailed) or P=0.5"0=0.00195 (vald) 


Are you still convinced this is an adaptation? 


You realise that: 


-all of the desert species are Neobatrachus 
- all of the mesic species are Heleioporus 


Neobatrachus species — Heleioporus species 
vvvvv 


> 


Simple statistics: What is the change that al 10 predictions come out as expected? > 
P=0.5°2=0.25 (one-tailed) or P=0.51=0.5 (wo-aled) 1! 


The problem with relatedness of species. 


Are these 5 species of Neobatrachus independent of each other? 
Are these 5 species of Heleioporus independent of each other? 

No! Closely-related species are likely to be less independent than distantly-related 
species. 


Species are NOT independent data points 


Conventional Statistics: ANOVA and REGRESSION 


There are two conventional strategies that are often used for analysing multiple species 
data: 


> analysis of variance (ANOVA) 
> linear regression (OLSR) 
Which one we use depends on whether we have measured one or two numerical traits: 


> if we measure one numerical trait (e.g. metabolic rate), and one categorical factor 
(e.g. arid, mesic, polar), then we analyse the data by ANOVA. 


> If we measure two numerical traits (e.g. metabolic rate and body mass), then we 
analyse the data by linear regression. 


But these ignore relatedness, or phylogeny! — Just like the simple stats I showed 
earlier. 


One trait - ANOVA single factor (species) 


‘There is a significant effect of species on metabolic rate 
= but what about desert versus mesic differences? 


two factor ANOVA (species and habitat) 


It doesn't work! 


Not all combinations of the two factors are present 
e.g. species A is only desert 


How do we compare multiple species by REGRESSION? 


Two numeric traits 
— Regression 
We ignore phylogeny for now 


Regression analysis 


We use regression here to see if there is a linear effect of the X variable on the Y 
variable. If so, then we need to remove this effect to compare species. 


Motabote rate 


‘There is a significant mass effect between the 4 species 
(but there might be no discernable mass effect within each species). 


We should account for body mass when comparing these species. 


Regression analysis 


Might there be an mesic-arid difference? 


Is mesic still lower, if we account for the mass effect? 


Metabo rate 


Regression residuals are really useful here 


We can calculate residuals (how far a point is above or below predicted) 
from the regression line for each point. 
Residuals remove any effect of X (e.g. mass) on Y (e.g. metabolic rate). 


Metabotc rate 


Mass igl 


Desert-Mesic effect on regression residuals? 


If we compare the residuals (by ANOVA), there are clearly no differences between arid 


and mesic species 
(after accountng for thE SEE FEET 


Metsbose rate 


A different example of regression residuals 
in metabolic rate vs mass 


In this example, the residuals clearly differ for the various species, showing that 
residuals can be a useful tool for multiple species comparisons (abit like ANOVA). 


Metabote rate 


(in this case, the overall regression has doubtful meaning) 


We have been assuming “independence” of traits 


for multiple species 


Consider this: 
when analysing traits, an important question is whether the 
trait can be considered to have evolved “independently” in 
species (not whether the species are phylogenetically 
”independent") 


For example, with these species of birds (@) and mammals, 
if metabolic rate is “independent” of phylogeny then we can 
consider the DATA for these species to be independent 


~ In this case, the trait DATA are a “star” phylogeny 
THIS MAY OR MAY NOT BE TRUE FOR DIFFERENT TRAITS 


Assumed 


HYLOGENY”| 


[1] 


AE 


KUE 


il 


So our data is NEVER independent! 


What is te soktion? 


The statistical issue when analysing multiple species is the 
question: are the trait values for different species 


Clearly, © are probably NOT phylogenetically "independent" 
‘© and æ might be phylogenetically “independent” 


Independence is critical 


Often, PHYLOGENY must be properly accounted for to reveal any real relationships 
‘Treating dependent species as independent is a form of “pseudo-replication” 
2 cocoon-forming Neobatrachus and 2 non-cocooning Heleioporus 
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Four independent species nn OF two independent samples? 


.. if the species are phylogenetically “independent” 


then the 4 species might conform 


to the same common regression 
relationship and can be considered 


to be independent 


no phylogenetic effect 


.. if the species are not phylogenetically “independent” 


closely related pairs might conform 
to the same common slope but have 
an offset 
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Walmer et ai chapter 1 Fig. 1.3 


Some METHODS for phylogenetic analysis 


Taxonomic base: 

+ analysis ot covariance 

ts PGLS is one of the most commonly used 
+ nested analysis of variance methods and alma us 10 separate the 
‘tects plogen fom te fet of 


Phylogenetic base: ‘ther at in question 
+ independent contrasts 
+ autoregression 
+ PvR 
+ pais 


